首页> 外文OA文献 >Off-Policy General Value Functions to Represent Dynamic Role Assignments in RoboCup 3D Soccer Simulation

【2h】

Off-Policy General Value Functions to Represent Dynamic Role Assignments in RoboCup 3D Soccer Simulation

机译：用于表示动态角色分配的非策略一般值函数在RoboCup 3D足球模拟中

页面导航

摘要
著录项
相似文献
相关主题

摘要

Collecting and maintaining accurate world knowledge in a dynamic, complex,adversarial, and stochastic environment such as the RoboCup 3D SoccerSimulation is a challenging task. Knowledge should be learned in real-time withtime constraints. We use recently introduced Off-Policy Gradient Descentalgorithms within Reinforcement Learning that illustrate learnable knowledgerepresentations for dynamic role assignments. The results show that the agentshave learned competitive policies against the top teams from the RoboCup 2012competitions for three vs three, five vs five, and seven vs seven agents. Wehave explicitly used subsets of agents to identify the dynamics and thesemantics for which the agents learn to maximize their performance measures,and to gather knowledge about different objectives, so that all agentsparticipate effectively and efficiently within the group.

机译：在动态，复杂，对抗性和随机环境（例如RoboCup 3D SoccerSimulation）中收集和维护准确的世界知识是一项艰巨的任务。应该在有时间限制的情况下实时学习知识。我们在强化学习中使用了最近引入的非政策梯度下降算法，该算法说明了动态角色分配的可学习知识表示。结果表明，代理商已从RoboCup 2012竞争中学习了与顶级团队的竞争政策，其中包括三对三，五对五，七对七。我们已经明确地使用代理的子集来识别代理学习用于最大化其绩效指标的动态和语义，并收集有关不同目标的知识，从而使所有代理都可以有效地参与到小组中。

著录项

作者
Abeyruwan, Saminda; Seekircher, Andreas; Visser, Ubbo;
展开▼
作者单位

展开▼
年度 2014
总页数
原文格式 PDF
正文语种 {"code":"en","name":"English","id":9}
中图分类

相似文献

外文文献
中文文献
专利

1. A simple method for decision making in RoboCup soccer simulation 3D environment [J] . Khashayar Niki Maleki, Mohammad Hadi Valipour, Roohollah Yeylaghi Ashrafi, Revista Avances en Sistemas e Informática . 2008,第3期

机译：RoboCup足球模拟3D环境中的一种简单决策方法
2. Evaluation-function modeling with multi-layered perceptron for RoboCup soccer 2D simulation [J] . Takuya Fukushima, Tomoharu Nakashima, Hidehisa Akiyama Artificial life and robotics . 2020,第3期

机译：用于Robocup足球2D模拟的多层Perceptron评估功能建模
3. ADAPTIVE PARTICLE FILTER FOR SELF-LOCALIZATION OF ROBOCUP 3D SOCCER ROBOTS [J] . Zhiwei Liang, Yue Hao International Journal of Robotics & Automation . 2014,第3期

机译：机器人杯3D足球机器人自定位的自适应粒子滤波
4. Design of dynamic role assignment based on RoboCup 3D simulation [C] . Xuesi Li, Haobin Shi, Yang Liu, International Conference on Future Computer and Information Technology . 2014

机译：基于Robocup 3D模拟的动态角色分配设计
5. Development and implementation of the Multi-resolution Assignment and Loading of Transportation Activities (MALTA) simulation based dynamic traffic assignment system, Recursive On-line Load Balance framework (ROLB) [D] . Villalobos Prats, Jorge Alejandro 2011

机译：基于动态交通分配系统，递归在线负载平衡框架（ROLB）的多分辨率运输活动分配和装载（MALTA）模拟的开发和实现
6. Structure-Preserving Imitation Learning With Delayed Reward: An Evaluation Within the RoboCup Soccer 2D Simulation Environment [O] . Quang Dang Nguyen, Mikhail Prokopenko 2020

机译：延迟奖励的结构保留模仿学习：Robocup Soccer 2D模拟环境中的评估
7. RoboCup Soccer 2D Simulation League：Introduction to the RoboCup Soccer Simulator [O] . Hidehisa AKIYAMA 2011

机译：Robocup足球2D仿真联盟：Robocup足球模拟器介绍

Off-Policy General Value Functions to Represent Dynamic Role Assignments in RoboCup 3D Soccer Simulation

摘要

著录项

相似文献

相关主题

期刊订阅